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1 <110> APPLICANT: University of Illinois at C&^iQAj - 

2 <120> TITLE OF INVENTION: HUMAN HEMATOPOIETIC A • / * 

3 <130> FILE REFERENCE: MBHB : CU08/PPA<160> 17 <170> Patentln jCQ Ojjjf ^ 

ERRORED SEQUENCES _ "7H * - ^/K , I ^ / 

RECEIVED 703 m ^ 

5 <210> SEQ ID NO: 1<211> 2328<212> DNA<213> H(S^sa$iens<220><221> CDS<222> 
E--> 6 <211> LENGTH: 

E--> 7 <212> TYPE: ^/Stf 0 8 200Z 

W--> 8 <213> ORGANISM: 

9 <223> OTHER INFORMATION: Human Hiwi Protein . ArM -rpn ^0^/0900 
W-> 0 <160> NUMBER OF. SEQ ID NOS: TECH CEN -™ IGCJ/iAW 

E--> 10 <400> SEQUENCE: 1 

11 atg ate ttt ggt gtg aac aca agg cag aac tta gac cat gtt aaa gaa 48 

12 Met lie Phe Gly Val Asn Thr Arg Gin Asn Leu Asp His Val Lys Glu 

13 1 5 10 15 

14 tea aaa aca ggt tct tea ggc att ata gta agg tta age act aac cat 96 

15 . Ser Lys Thr Gly Ser Ser Gly He He Val Arg Leu Ser Thr Asn His 

16 20 25 30 

17 ttc egg ctg aca tec egt cec cag tgg gee tta tat cag tat cac att 144 

18 Phe Arg Leu Thr Ser Arg Pro Gin Trp Ala Leu Tyr Gin Tyr His He 

19 35 40 45 

20 gac tat aac eca ctg atg gaa gee aga aga etc egt tea get ctt ctt 192 

21 Asp Tyr Asn Pro Leu Met Glu Ala Arg Arg Leu Arg Ser Ala Leu Leu 

22 50 55 60 

23 ttt caa cac gaa gat eta att gga aag tgt cat get ttt gat gga acg 240 

24 Phe Gin His Glu Asp Leu He Gly Lys Cys His Ala Phe Asp Gly Thr 

25 65 70 75 " 80 

26 ata tta ttt tta cct aaa aga eta cag caa aag gtt act gaa gtt ttt 288 

27 He Leu Phe Leu Pro Lys Arg Leu Gin Gin Lys Val Thr Glu Val Phe 

28 85 90 95 

29 agt aag acc egg aat gga gag gat gtg agg ata acg ate act tta aca 336 

30 Ser Lys Thr Arg Asn Gly Glu Asp Val Arg He Thr He Thr Leu Thr 

31 100 105 110 

32 aat gaa ctt cca cct aca tea eca act tgt ttg cag ttc tat aat att 384 

33 Asn Glu Leu Pro Pro Thr Ser Pro Thr Cys Leu Gin Phe Tyr Asn He 

34 115 120 125 

35 att ttc agg agg ctt ttg aaa ate atg aat ttg caa caa att gga cga 432 

36 He Phe Arg Arg Leu Leu Lys lie Met Asn Leu Gin Gin He Gly Arg 

37 130 135 140 

38 aat tat tat aac cca aat gac cca att gat att cca agt cac agg ttg 480 

39 Asn Tyr Tyr Asn Pro Asn Asp Pro He Asp He Pro Ser His Arg Leu 
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261 770 775 

263 <210> SEQ ID NO: 3<211> 24<212> DNA<213> Artificial Sequence<220><223> Glyceraldehyde 
W--> 264 phosphate dehydrogenase cDNA forward primer 
E--> 265 <211> LENGTH: 
E--> 266 <212> TYPE: 
W--> 267 <213> ORGANISM: 

268 <400> SEQUENCE: 3 

269 ggctgagaac gggaagcttg teat 24 

271 <210> SEQ ID NO: 4<211> 24<212> DNA<213> Artificial Sequence<220><223> Glyceraldehyde 
W--> 272 phosphate dehydrogenase cDNA reverse primer 
E--> 273 <211> LENGTH: 
E--> 274 <212> TYPE: 
W--> 275 <213> ORGANISM: 

276 <400> SEQUENCE: 4 

277 cagccttctc catggtggtg aaga 24 
279 <210> SEQ ID NO: 5<211> 20<212> DNA<213> Artificial Sequence<220><223> B2 

W--> 280 microglobulin forward primer 
E--> 281 <211> LENGTH: 
E--> 282 <212> TYPE: 
W--> 283 <213> ORGANISM: 

284 <400> SEQUENCE: 5 

285 ctcgcgctac tctctctttc 20 
287 <210> SEQ ID NO: 6<211> 21<212> DNA<213> Artificial Sequence<220><223> B2 

W--> 288 microglobulin reverse primer 
E--> 289 <211> LENGTH: 
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E--> 290 <212> TYPE: 
W--> 291 <213> ORGANISM: 

292 <400> SEQUENCE: 6 

293 catgtctcga tcccacttaa c 21 

295 <210> SEQ ID NO: 7<211> 22<212> DNA<213> Artificial Sequence<220><223> CD34 hiwi DNA 
W--> 296 forward primer 
E--> 297 <211> LENGTH: 
E--> 298 <212> TYPE: 
W--> 299 <213> ORGANISM: 

300 <400> SEQUENCE: 7 

301 gaagcagcct gtcttggtca gc 22 

303 <210> SEQ ID NO: 8<211> 25<212> DNA<213> Artificial Sequence<220><223> CD34 hiwi DNA 
W--> 304 reverse primer 
E .-> 305 <211> LENGTH: 
E--> 306 <212> TYPE: 
W--> 307 <213> ORGANISM: 

308 <400> SEQUENCE: 8 

309 gaatcaaagc tcaaacccca gtctc 25 

311 <210> SEQ ID NO: 9<211> 25<212> DNA<213> Artificial Sequence<220><223> human testis 
W--> 312 hiwi gene reverse primer 
E--> 313 <211> LENGTH: 
E--> 314 <212> TYPE: 
W--> 315 <213> ORGANISM: 

316 <400> SEQUENCE: 9 

317 cgctgtatgt ggtctggctt caggc 25 

319 <210> SEQ ID NO: 10<211> 30<212> DNA<213> Artificial Sequence<220><223> human testis 



SEQUENCE: 10 

gggagaaaca ctaccacttc tcacagcctg 30 
SEQ ID NO: 11<211> 27<212> DNA<213> Artificial Sequence<220><223> Marathon 



w- 


-> 


320 


hiwi < 


E- 


-> 


321 


<211> 


E- 


-> 


322 


<212> 


W- 


-> 


323 


<213> 






324 


<400> 






325 








327 


<210> 


Ready 




W- 


-> 


328 


Human 


E- 


-> 


329 


<211> 


E- 


-> 


330 


<212> 


W- 


-> 


331 


<213> 






332 


<400> 






333 








335 


<210> 


Ready 




W- 


-> 


336 


Human 


E- 


-> 


337 
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E- 


-> 
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<212> 


W- 


-> 
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<213> 






340 


<400> 






341 








343 


<210> 


E- 


-> 


344 


<211> 


E- 


-> 


345 


<212> 



SEQUENCE: 11 

ccatcctaat acgactcact atagggc 2 ? 
SEQ ID NO: 12<211> 23<212> DNA<213> Artificial Sequence<220><223> Marathon 



SEQUENCE: 12 

actcactata gggctcgagc ggc 23 
SEQ ID NO: 13<211> 844<212> PRT<213> Drosophila melanogaster<220><221> 
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W--> 346 <213> ORGANISM: 



347 <223> OTHER INFORMATION: PIWI protein 
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424 






595 










600 










605 








425 


Leu 


Met 


Thr 


He 


Glv 


Phe 


Asp 


He 


Ala 


Lys 


Ser 


Thr 


Arg 


Asp 


Ara 


Lys 


426 




610 










615 










620 










427 


A rcr 


Ala 


Tvr 

y 


Glv 


Ala 


Leu 


He 


Ala 


Ser 


Met 


Asp 


Leu 


Gin 


Gin 


Asn 


Ser 


428 


625 










630 










635 










640 


429 


Thr 


Tvr 


Phe 


Ser 


Thr 


Val 


Thr 


Glu 


Cys 


Ser 


Ala 


Phe 


Asp 


Val 


Leu 


Ala 


430 










645 










650 










655 




431 


Asn 


Thr 


Leu 


TrD 


Pro 


Met 


He 


Ala 


Lvs 


Ala 


Leu 


Ara 


Gin 


Tvr 


Gin 


His 


432 








660 










665 










670 






433 


Glu 


His 


Arg 


Lys 


Leu 


Pro 


Ser 


Arg 


He 


Val 


Phe 


Tyr 


Arg 


Asp 


Gly 


Val 


434 






675 










680 










685 








435 


Ser 


Ser 


Gly 


Ser 


Leu 


Lys 


Gin 


Leu 


Phe 


Glu 


Phe 


Glu 


Val 


Lys 


Asp 


lie 


436 




690 










695 










700 










437 


He 


Glu 


Lys 


Leu 


Lys 


Thr 


Glu 


Tyr 


Ala 


Arg 


Val 


Gin 


Leu 


Ser 


Pro 


Pro 


438 


705 










710 










715 










720 


439 


Gin 


Leu 


Ala 


Tyr 


lie 


Val 


Val 


Thr 


Arg 


Ser 


Met 


Asn 


Thr 


Arg 


Phe 


Phe 


440 










725 










730 










735 




441 


Leu 


Asn 


Gly 


Gin 


Asn 


Pro 


Pro 


Pro 


Gly 


Thr 


He 


Val 


Asp 


Asp 


Val 


lie 


442 








740 










745 










750 






443 


Thr 


Leu 


Pro 


Glu 


Arg 


Tyr 


Asp 


Phe 


Tyr 


Leu 


Val 


Ser 


Gin 


Gin 


Val 


Arg 
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RAW SEQUENCE LISTING DATE: 04/25/2002 

PATENT APPLICATION: US/10/04 3 , 774A TIME: 14:31:10 

Input Set : N:\Crf3\04102002\J043774.raw 
Output Set: N:\CRF3\04252002\J043774A.raw 



444 






755 










760 






765 








445 


Gin 


Gly 


Thr 


Val 


Ser 


Pro 


Thr 


Ser 


Tyr Asn Val 


Leu 


Tyr 


Ser 


Ser 


Met 


446 




770 










775 






780 










447 


Gly 


Leu 


Ser 


Pro 


Glu 


Lys 


Met 


Gin 


Lys Leu Thr 


Tyr 


Lys 


Met 


Cys 


His 


448 


785 










790 






795 










800 


449 


Leu 


Tyr 


Tyr 


Asn 


Trp 


Ser 


Gly 


Thr 


Thr Arg Val 


Pro 


Ala 


Val 


Cys 


Gin 


450 










805 








810 








815 




451 


Tyr 


Ala 


Lys 


Lys 


Leu 


Ala 


Thr 


Leu 


Val Gly Thr 


Asn 


Leu 


His 


Ser 


He 


452 








820 










825 






830 






453 


Pro 


Gin 


Asn 


Ala 


Leu 


Glu 


Lys 


Lys 


Phe Tyr Tyr 


Leu 










454 






835 










840 















456 <210> SEQ ID NO: 14<211> 29<212> DNA<213> Artificial Sequence<220><223> CD34 
W--> 457 hematopoietic cell cDNA forward primer 
E--> 458 <211> LENGTH: 
E--> 459 <212> TYPE: 
W--> 460 <213> ORGANISM: 

461 <400> SEQUENCE: 14 

462 atgatctttg gtgtgaacac aaggcagaa 29 
464 <210> SEQ ID NO: 15<211> 31<212> DNA<213> Artificial Sequence<220><223> CD34 

W--> 4 65 hematopoietic cell cDNA reverse primer 
E--> 466 <211> LENGTH: 
E--> 467 <212> TYPE: 
W--> 468 <213> ORGANISM: 

469 <400> SEQUENCE: 15 

470 gaggtagtaa aggcggtttg acagtgacag a 31 
472 <210> SEQ ID NO: 16<211> 24<212> DNA<213> Artificial Sequence<220><223> human hiwi 

W--> 4 73 gene forward primer 
E--> 474 <211> LENGTH: 
E--> 475 <212> TYPE: 
W--> 476 <213> ORGANISM: 

477 <400> SEQUENCE: 16 

478 ccttgccagt acgcccacaa gctg 24 

480 <210> SEQ ID NO: 17<211> 27<212> DNA<213> Artificial Sequence<220><223> human hiwi 
W--> 4 81 gene reverse primer 
E--> 482 <211> LENGTH: 
E--> 483 <212> TYPE: 
W--> 484 <213> ORGANISM: 

485 <400> SEQUENCE: 17 

486 ccccacctat ggttgtagtg agcatcc 27 
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RAW SEQUENCE LISTING ERROR SUMMARY 
PATENT APPLICATION: US/10/04 3 , 774A 



DATE: 04/25/2002 
TIME: 14:31:11 



Input Set : N:\Crf3\04102002\J043774.raw 
Output Set: N:\CRF3\04252002\J04 3774A.raw 



Invalid Line Length: 

The rules require that a line not exceed 72 characters in length. This includes spaces. 

Seq#:2; Line(s) 263 
Seq#:3; Line(s) 271 
Seq#:6; Line(s) 295 
Seq#:7; Line(s) 303 
Seq#:8; Line(s) 311 
Seq#:9; Line(s) 319 
Seq#:10; Line(s) 327 
Seq#:ll; Line(s) 335 
Seq#:15; Line(s) 472 
Seq#:16; Line(s) 480 
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VERIFICATION SUMMARY DATE: 04/25/2002 

PATIENT APPLICATION: US/10/04 3 , 774A TIME: 14:31:11 

Input Set : N:\Crf3\04102002\J043774.raw 
Output Set: N:\CRF3\04252002\J043774A.raw 

L : 6 M : 210 E : (40) Invalid Number of Sequences, LENGTH: 

L:7 M:310 E: (3) Wrong or Missing Sequence Type, TYPE: 

L:8 M:201 W: Mandatory field data missing, <213> ORGANISM 

L:0 M:282 W: Numeric Field Identifier Missing, <160> is required. 

L:10 M:200 E: Mandatory Header Field missing, <220> not found for SEQ ID# : 1 

L:160 M:210 E: (40) Invalid Number of Sequences, LENGTH: 

L:161 M:310 E: (3) Wrong or Missing Sequence Type, TYPE: 

L-162 M:201 W: Mandatory field data missing, <213> ORGANISM 

L:163 M:212 E: (34) Invalid or duplicate Sequence ID Number, SEQUENCE ID NOS : 2 differs: 

L:264 M:259 W: Allowed number of lines exceeded, <210> SEQ ID NO 

L:265 M:210 E: (40) Invalid Number of Sequences, LENGTH: 

L:266 M:310 E: (3) Wrong or Missing Sequence Type, TYPE: 

L:267 M:201 W: Mandatory field data missing, <213> ORGANISM 

L:272 M:259 W: Allowed number of lines exceeded, <210> SEQ ID NO 

L:273 M:210 E: (40) Invalid Number of Sequences, LENGTH: 

L:274 M:310 E: (3) Wrong or Missing Sequence Type, TYPE: 

L:275 M:201 W: Mandatory field data missing, <213> ORGANISM 

L:280 M:259 W: Allowed number of lines exceeded, <210> SEQ ID NO 

L:281 M:210 E: (40) Invalid Number of Sequences, LENGTH: 

L:282 M:310 E: (3) Wrong or Missing Sequence Type, TYPE: 

L:283 M:201 W: Mandatory field data missing, <213> ORGANISM 

L:288 M:259 W: Allowed number of lines exceeded, <210> SEQ ID NO 

L:289 M:210 E: (40) Invalid Number of Sequences, LENGTH: 

L:290 M : 310 E : (3) Wrong or Missing Sequence Type, TYPE: 

L:291 M:201 W: Mandatory field data missing, <213> ORGANISM 

L:296 M:259 W: Allowed number of lines exceeded, <210> SEQ ID NO 

L:297 M:210 E: (40) Invalid Number of Sequences, LENGTH: 

L:298 M:310 E: (3) Wrong or Missing Sequence Type, TYPE: 

L:299 M:201 W: Mandatory field data missing, <213> ORGANISM 

L:304 M:259 W: Allowed number of lines exceeded, <210> SEQ ID NO 

L:305 M:210 E: (40) Invalid Number of Sequences, LENGTH: 

L:306 M:310 E: (3) Wrong or Missing Sequence Type, TYPE: 

L:307 M:201 W: Mandatory field data missing, <213> ORGANISM 

L:312 M:259 W: Allowed number of lines exceeded, <210> SEQ ID NO 

L:313 M:210 E: (40) Invalid Number of Sequences, LENGTH: 

L : 314 M : 310 E : (3) Wrong or Missing Sequence Type, TYPE: 

L:315 M:201 W: Mandatory field data missing, <213> ORGANISM 

L:320 M:259 W: Allowed number of lines exceeded, <210> SEQ ID NO 

L:321 M:210 E: (40) Invalid Number of Sequences, LENGTH: 

L:322 M:310 E: (3) Wrong or Missing Sequence Type, TYPE: 

L:323 M:201 W: Mandatory field data missing, <213> ORGANISM 

L:328 M:259 W: Allowed number of lines exceeded, <210> SEQ ID NO 

L:329 M:210 E: (40) Invalid Number of Sequences, LENGTH: 

L:330 M:310 E: (3) Wrong or Missing Sequence Type, TYPE: 

L:331 M:201 W: Mandatory field data missing, <213> ORGANISM 

L:336 M:259 W: Allowed number of lines exceeded, <210> SEQ ID NO 

L : 337 M : 210 E : (40) Invalid Number of Sequences, LENGTH: 

L:338 M:310 E: (3) Wrong or Missing Sequence Type, TYPE: 
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VERIFICATION SUMMARY 

PATENT APPLICATION: US/10/043 , 774A 



DATE: 04/25/2002 
TIME: 14:31:11 



Input Set : N:\Crf3\04102002\J043774.raw 
Output Set: N:\CRF3\04252002\J043774A.raw 



L:339 M:201 W: Mandatory field data missing, <213> ORGANISM 

L:344 M:210 E: (40) Invalid Number of Sequences, LENGTH: 

L:345 M:310 E: (3) Wrong or Missing Sequence Type, TYPE: 

L:346 M:201 W: Mandatory field data missing, <213> ORGANISM 

L:348 M:200 E: Mandatory Header Field missing, <220> not found for SEQ ID#:13 

L:457 M:259 W: Allowed number of lines exceeded, <210> SEQ ID NO 

L:458 M:210 E: (40) Invalid Number of Sequences, LENGTH: 

L:459 M:310 E: (3) Wrong or Missing Sequence Type, TYPE: 

L:460 M:201 W: Mandatory field data missing, <213> ORGANISM 

L:465 M:259 W: Allowed number of lines exceeded, <210> SEQ ID NO 

L:466 M:210 E: (40) Invalid Number of Sequences, LENGTH: 

L:467 M:310 E: (3) Wrong or Missing Sequence Type, TYPE: 

L:468 M:201 W: Mandatory field data missing, <213> ORGANISM 

L:473 M:259 W: Allowed number of lines exceeded, <210> SEQ ID NO 

L:474 M:210 E: (40) Invalid Number of Sequences, LENGTH: 

L:475 M:310 E: (3) Wrong or Missing Sequence Type, TYPE: 

L:476 M:201 W: Mandatory field data missing, <213> ORGANISM 

L:481 M:259 W: Allowed number of lines exceeded, <210> SEQ ID NO 

L:482 M:210 E: (40) Invalid Number of Sequences, LENGTH: 

L:483 M:310 E: (3) Wrong or Missing Sequence Type, TYPE: 

L:484 M:201 W: Mandatory field data missing, <213> ORGANISM 

L-0 M:203 E: No. of Seq . differs, <160> Number Of Sequences : Input (0) Counted (17) 
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STATISTICS SUMMARY DATE: 04/25/2002 

PATENT APPLICATION: US/10/043 , 774A - TIME: 14:31:11 



Input Set : N;\Crf 3\04102002\J043774 . raw 
Output Set: N:\CRF3\04252002\J04 3774A.raw 

Application Serial Number: US/10/043 , 774A 

Alpha or Numeric: Numeric 

Application Class: 

Application File Date: 03-29-2002 

Art Unit: OIPE 

Software Application: 

Total Number of Sequences: 17 

Total Nucleotides: 0 

Total Amino Acids: 0 

Number of Errors: 38 

Number of Warnings: 32 

Number of Corrections : 0 

MESSAGE SUMMARY 

200 E : 2 (Mandatory Header Field missing) 

201 W: 17 (Mandatory field data missing) 
203 E: 1 (No. of Seq . differs) 

210 E: 17 ((40) Invalid Number of Sequences) 

212 E: 1 ((34) Invalid or duplicate Sequence ID Number) 

259 W: 14 (Allowed number of lines exceeded) 

282 W: 1 (Numeric Field Identifier Missing) 

310 E: 17 ((3) Wrong or Missing Sequence Type) 
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